Analyzing Count Min Sketch with Conservative Updates
نویسندگان
چکیده
Count-Min Sketch with Conservative Updates (CMS-CU) is a popular algorithm to approximately count items’ appearances in data stream. Despite CMS-CU’s widespread adoption, the theoretical analysis of its performance still wanting because inherent difficulty. In this paper, we propose novel approach study CMS-CU and derive new upper bounds on both expected value CCDF estimation error under an i.i.d. request process. Our formulas can be successfully employed improved estimates for precision heavy-hitter detection methods configuration rules CMS-CU. The are evaluated synthetic real traces.
منابع مشابه
Count-Min Sketch
DEFINITION The Count-Min (CM) Sketch is a compact summary data structure capable of representing a high-dimensional vector and answering queries on this vector, in particular point queries and dot product queries, with strong accuracy guarantees. Such queries are at the core of many computations, so the structure can be used in order to answer a variety of other queries, such as frequent items ...
متن کاملCount-Min-Log sketch: Approximately counting with approximate counters
Count-Min Sketch [1] is a widely adopted algorithm for approximate event counting in large scale processing. However, the original version of the Count-Min-Sketch (CMS) suffers of some deficiences, especially if one is interested in the low-frequency items, such as in textmining related tasks. Several variants of CMS [5] have been proposed to compensate for the high relative error for low-frequ...
متن کاملCount-Min Tree Sketch: Approximate counting for NLP
The Count-Min Sketch [1] is a widely adopted structure for approximate event counting in large scale processing. In a previous work [7] we improved the original version of the Count-Min-Sketch (CMS) with conservative update using approximate counters [6, 4] instead of linear counters. These structures are computationaly efficient and improve the average relative error (ARE) of a CMS at constant...
متن کاملLossy Conservative Update (LCU) Sketch: Succinct Approximate Count Storage
In this paper, we propose a variant of the conservativeupdate Count-Min sketch to further reduce the overestimation error incurred. Inspired by ideas from lossy counting, we divide a stream of items into multiple windows, and decrement certain counts in the sketch at window boundaries. We refer to this approach as a lossy conservative update (LCU). The reduction in overestimation error of count...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Networks
سال: 2022
ISSN: ['1872-7069', '1389-1286']
DOI: https://doi.org/10.1016/j.comnet.2022.109315